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BACKGROUND OF THEv INVENTION 



Field of the Invention 

The present invention generally relates to the 
field of intellectual\property audit systems, and more 
particularly, to the field of intellectual property 
systems which collects pertinent data regarding an 
intellectual property portfolio and analyzes the 
collected data against empirical data to provide a 
qualitative and/or quantitative analysis of the 
intellectual property portfolio. 



Description of the Related Art 

In general, purchasers ofy assets which are 
intellectual property intensive^ typically purchase 
these assets based upon some estimated value which, of 
course, begins with an offer for\sale. When a creditor 
is considering advancing funds ba^ed upon the value of 
fixed assets, such as equipment, an appraisal is 
performed and a liquidation value is determined. Then 
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1 a liquidity adjustment is considered and a liquidation 

2 value is concluded. The same valuation approaches can 

3 be employed to determine the liquidation value of 

4 intellectual property. It is known to value 

5 intellectual property assets with respect to various 

6 accounting procedures which conform to Generally 

7 Accepted Accounting Procedures (GAAP) . There are 

8 typically three such procedures: cost, market and 

9 income approaches. 

10 The cost approach gives consideration to the cost 

11 to reproduce or replace the subject intellectual 

12 property. For example, for patent intellectual 

13 property, this approach would consider the cost 

14 associated with research, engineering, design and 

15 testing activities. In trademark cases the advertising 

16 expenses that would be needed to create a trademark of 

17 similar prominence would be considered. 

18 From this amount an allowance is deducted to 

19 reflect all forms of depreciation or obsolescence 

20 present, whether it arises from physical, functional or 

21 economic causes. Physical deterioration (depreciation) 

22 is the loss in value resulting from wear and tear from 

23 operations and exposure to the elements. Functional 

24 obsolescence is the loss in value within the property 

25 as a result of such things as changes in design, 

26 materials, or processes, overcapacity, inadequacy, 

27 excess construction, lack of utility, and excess 

28 operating cost. Economic obsolescence is the loss in 

29 value that results from influences external to tJhe 

30 property such as the general state of economy, the 

31 effect of governmental regulations, and the like. A 
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summary of the cost approach is presented below. 
Replacement cost 

Less: Physical Depreciation 

Less: Functional Obsolescence 



Less : 



Economic Obsolescence 



Equals: Fair Market Value 

Physical and functional obsolescence is not 
usually an important factor when valuing intellectual 
property but care is needed to consider the economic 
obsolescence that can be introduced by outside forces. 

Unfortunately the cost to develop intellectual 
property rarely bears any relationship tot he economic 
earning power of the property or the value of the 
property . 

The market approach gives consideration to prices 
paid for similar property in arm's length transactions. 
Adjustments can be made, if necessary, to the indicated 
market prices to reflect the condition and utility of 
the property being appraised relative to the market 
comparative. This approach is applicable where there 
is an active market with a sufficient quantity of 
reliable and verifiable data. Usually, similar 
property that exchanged between independent parties for 
which price data is disclosed is impossible to find for 
intellectual properties. The activities of the 
Resolution Trust Corporation are however beginning to 
provide some of the previously missing data. At 
present, the market approach is difficult to implement 
for intellectual property. 

The income approach, by default, is still the most 
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1 preferred method. It considers the present value of 

2 the prospective economic benefits of owing the 

3 appraised property. This involves a capitalization of 

4 the forecasted income stream with consideration given 

5 to the duration of the income and the risks related to 

6 its achievement. 

7 Care must be employed to assure that economic 

8 benefits derived from the intellectual property are 

9 isolated from the contribution to earnings derived from 
^10 the complementary assets of the business. When 

mil properly done, the income approach can provide an 

iJ 12 accurate indication of the fair market value of 

u\13 intellectual property. Once the fair market value of 

O 14 the intellectual property portfolio has been 

¥ ~ 15 determined, then as indicated above, the fair market 

□ 16 value is adjusted according to conventional methods 

^ 17 which consider effects such as amount of time required 

id 18 to dispose of the portfolio, market evidence of similar 

p 19 intellectual property portfolios sold in liquidation 

20 and cost to liquidate the property. 

21 These accepted accounting methods rely or function 

22 on the availability of sufficient data relating to the 

23 intellectual property portfolio itself. Thus, in this 

24 situation, the seller of the intellectual property 

25 portfolio typically has used and marketed the 

26 intellectual property over a sufficiently long time 

27 period that suitable data has been collected to 

28 formulate a price based upon one of the above 

29 accounting valuation techniques. However, these 

3 0 accounting techniques typically do not provide reliable 

31 and/or dependable valuation results when the seller of 
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the intellectual property portfolio has not collected 
data or has not used or marketed the portfolio long 
enough to obtain such data. 

In addition, for typical purchases of intellectual 
property assets, there is typically unavailable an 
independent indicator of the worth of the intellectual 
property to be sold. The independent indicator which 
is lacking may be either a qualitative or quantitative 
indicator of the worth of the intellectual property 
portfolio. 

Accordingly, it is desirable to provide an 
independent analysis of an intellectual property 
portfolio including an independent qualitative or 
quantitative worth indicator of the intellectual 
property portfolio to be acquired. 

In addition, it is also desirable to provide an 
intellectual property audit system that does not depend 
on the owner of the portfolio having previously used 
and marketed the portfolio. 

It is further desirable to provide an intellectual 
property audit system which can be used to determine 
the qualitative and/or quantitative value of the 
intellectual property portfolio in an efficient and 
relatively rapid manner. 

It is also desirable to provide the qualitative 
and/or quantitative value by analyzing the intellectual 
property itself in a mechanized manner as well as 
considering external factors relating to, for example, 
characteristics of the purchasing and selling entities. 

Finally, it is also desirable that the 
intellectual property audit system be provided with the 
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1 ability to output requests for manual assistance to 

2 correct, for example, erroneously entered data or 

3 incomplete or insufficient data causing the 

4 intellectual property audit system to be unable to 

5 completely analyze the input data for determining of an 

6 intellectual property portfolio value. Accordingly, 

7 the audit system permits a user to manually correct or 

8 complete data to permit the audit system to determine a 

9 qualitative and/or quantitative intellectual property 

10 portfolio value. 

11 SUMMARY OF THE INVENTION 

12 It is, therefore, an object of the present 

13 invention to provide an independent analysis of an 

14 intellectual property portfolio including an 

15 independent qualitative or quantitative worth indicator 

16 of the intellectual property portfolio to be acquired. 

17 It is also an object of the present invention to 

18 provide an intellectual property audit system that does 

19 not depend on the owner of the portfolio having 

20 previously used and marketed the portfolio. 

21 It is also an object of the present invention to 

22 provide an intellectual property audit system which can 

23 be used to determine the qualitative and/or 

24 quantitative value of the intellectual property 

25 portfolio in an efficient and relatively rapid manner. 

26 It is another object of the present invention to 

27 provide the qualitative and/or quantitative value by 

28 analyzing the intellectual property itself in a 

29 mechanized manner as well as considering external 

30 factors relating to, for example, characteristics of 
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the purchasing and selling entities. 

Further, it is an object of the present invention 
that the intellectual property audit system be provided 
with the ability to output requests for manual 
assistance to correct, for example, erroneously entered 
data or incomplete or insufficient data causing the 
intellectual property audit system to be unable to 
completely analyze the input data for determining of an 
intellectual property portfolio value. Accordingly, 
the audit system permits a user to manually correct or 
complete data to permit the audit system to determine a 
qualitative and/or quantitative intellectual property 
portfolio value. 

To achieve these and other objects, the present 
invention provides an intellectual property computer- 
implemented audit system for valuing an intellectual 
property portfolio. The intellectual property audit 
system includes a first database storing first 
information relating to the intellectual property 
portfolio and a database access and collection device 
connected to the first database and accessing the first 
database and retrieving the first information. In 
addition, the intellectual property audit system also 
includes a second database storing empirical data 
relating to known intellectual property portfolios, and 
a comparison device connected to the database access 
and collection device and to the second database, the 
comparison device receiving the first information from 
the database access and collection device and comparing 
the first information to the empirical data retrieved 
from the second database producing an intellectual 
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1 property worth indicator indicating the worth of the 

2 intellectual property portfolio. 

3 These together with other objects and advantages 

4 which will be subsequently apparent, reside in the 

5 details of construction and operation as more fully 

6 hereinafter described and claimed, with reference being 

7 had to the accompanying drawings forming a part hereof, 

8 wherein like numerals refer to like elements 

9 throughout . 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

11 Fig. 1 is a detailed block diagram of the 

12 structure of the intellectual property audit system of 

13 the present invention; 

14 Fig. 2 is a conceptual diagram of the intellectual 

15 property audit system of the present invention; 

16 Figs, 3-6 are block diagrams illustrating 

17 additional embodiments of the pattern matching system; 

18 Fig. 7 is an illustration of a main central 

19 processing unit for implementing the computer 

20 processing; 

21 Fig. 8 is a block diagram of the internal hardware 

22 of the computer illustrated in Fig. 7; and 

23 Fig. 9 is an illustration of an exemplary memory 

24 medium which can be used with disk drives illustrated 

25 in Fig. 7 or Fig. 8. 

26 NOTATIONS AND NOMENCLATURE 

27 The detailed descriptions which follow may be 

28 presented in terms of program procedures executed on a 

29 computer or network of computers. These procedural 
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descriptions and representations are the means used by 
those skilled in the art to most effectively convey the 
substance of their work to others skilled in the art. 

A procedure is here, and generally, conceived to 
be a self -consistent sequence of steps leading to a 
desired result. These steps are those requiring 
physical manipulations of physical quantities. 
Usually, though not necessarily, these quantities take 
the form of electrical or magnetic signals capable of 
being stored, transferred, combined, compared and 
otherwise manipulated. It proves convenient at times, 
principally for reasons of common usage, to refer to 
these signals as bits, values, elements, symbols, 
characters, terms, numbers, or the like. It should be 
noted, however, that all of these and similar terms are 
to be associated with the appropriate physical 
quantities and are merely convenient labels applied to 
these quantities. 

Further, the manipulations performed are often 
referred to in terms, such as adding or comparing, 
which are commonly associated with mental operations 
performed by a human operator. No such capability of a 
human operator is necessary, or desirable in most 
cases, in any of the operations described herein which 
form part of the present invention; the operations are 
machine operations. Useful machines for performing the 
operation of the present invention include general 
purpose digital computers or similar devices. 

The present invention also relates to apparatus 
for performing these operations. This apparatus may be 
specially constructed for the required purpose or it 
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may comprise a general purpose computer as selectively 
activated or reconfigured by a computer program stored 
in the computer. The procedures presented herein are 
not inherently related to a particular computer or 
other apparatus. Various general purpose machines may 
be used with programs written in accordance with the 
teachings herein, or it may prove more convenient to 
construct more specialized apparatus to perform the 
required method steps. The required structure for a 
variety of these machines will appear from the 
description given. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

The intellectual property audit system according 
to the present invention may be used as an integrity 
check for acquisitions having assets involving a 
substantial intellectual property portfolio. The 
system could be used to compare the intellectual 
property portfolio to be acquired with other 
intellectual property portfolios having known market 
values to obtain an indicator of the intellectual 
property portfolio's worth. Depending on the quality 
of empirical data, the intellectual property audit 
system of the present invention could provide a 
qualitative and/or quantitative analysis of an. 
intellectual property portfolio which is to be 
acquired. 

Fig. 2 is a conceptual diagram of the intellectual 
property audit system of the present invention. The 
intellectual property audit system would operate in 
general terms as follows. In Fig. 2, the user of the 
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1 audit system would input the appropriate data to an 

2 input device 14. In the most basic form, the input 

3 data might simply be a list of patent numbers for an 

4 intellectual property portfolio comprising only 

5 patents . 

6 Next, the data would be transmitted to a database 

7 access device 16 which would collect various data from 

8 different on-line intellectual property databases 18. 
^ 9 The collected data represents different intellectual 

tfj 10 property worth indicators . Each worth indicator would 

*f\ 11 then be assigned a value which would be approximated 

t fl 12 using previously collected indicator values which are 

13 based on intellectual property portfolios which have 

14 known worth or dollar values by consulting empirical 
s 15 database 22. For example, the audit system could 

16 access a full text patent database such as DIALOG to 

ftj 17 analyze the listed patents. Specifically, the system 

18 could determine how many claims, dependent and 

u 19 independent, are in each patent. A high value would be 

20 assigned to this indicator is there are many claims, 

21 indicating that the drafter or owner of the patent 

22 considered the patent of significant importance. 

23 Additionally, the number of references cited or number 

24 of classes searched could also be gathered and a high 

25 value assigned to the indicator when the patent lists 

26 many classes or many cited references. The rationale 

27 for the high value would be that there is reason to 

28 believe that the examiner performed a detailed 

29 examination leaving the issued patent strong. Further, 

30 each patent could also be searched to determine how 

31 often the patent itself has been cited as a reference 
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1 on other patents. Higher values would be assigned to a 

2 patent cited more often than not, indicating that the 

3 patent was perhaps a breakthrough in a particular 

4 field. 

5 Similar information could also be collected for 

6 trademarks which are included in the intellectual 

7 property portfolio from such databases as DIALOG'S 

8 FEDERAL TRADEMARK SCAN and STATE TRADEMARK SCAN which 

9 inventory federal and state trademarks, respectively. 

10 Based upon these databases, a user might, for example, 

11 determine whether a trademark includes disclaimers to 

12 certain words in the trademark and/or how many classes 

13 the trademark has been issued for or covers . In 

14 addition, ORBIT'S LEGAL STATUS database includes recent 

15 information affecting the trademark, and LEXIS' NEXIS 

16 database could be used to determine any recent 

17 information relating to the trademark which has been 

18 published in trade magazines or newspapers. 

19 Finally, intellectual property which also includes 

20 copyrighted work could also be considered in a similar 

21 manner. For example, computer software related 

22 intellectual property might include both patents on the 

23 computer system as well as copyrights on the software 

24 itself. 

25 Once all the worth indicators have been 

26 determined, they are transmitted to an indicator 

27 comparing device 20 which would compare the collection 

28 of worth indicators to known collections of worth 

29 indicators from known intellectual property portfolios 

30 stored in empirical database 22. Known distribution or 

31 estimation techniques could be used to determine which 
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known intellectual property portfolio the intellectual 
property portfolio which is to be acquired matches the 
closest. Finally, the system would output the known 
portfolio worth value 24 for which the portfolio to be 
acquired matches the most, signifying a rough 
approximation of the worth of the portfolio to be 
acquired. A detailed description of the intellectual 
property audit system according to the present 
invention follows. 

Fig. 1 represents a block diagram representation 
of the proposed intellectual property audit system. In 
Fig. 1, data input device 2 is used to input the 
necessary data representing the intellectual property 
portfolio to be acquired. This data may be, for 
example, simply the list of patent numbers in the 
portfolio, or the data might include additional 
information relating to the specific intellectual 
property portfolio or the selling/acquiring entities 
which might not be readily retrievable from current 
databases. For example, the additional information 
might include financial information regarding the 
selling/acquiring entities or recent performance in the 
stock market. Data input device 2 is a standard input 
device and may include, for example, the data entry 
system in U.S. Patent 4,012,720 or the data entry / 
interface assembly in U.S. Patent 4,638,422, 
incorporated herein by reference. In any event, the 
data is entered and then transmitted to database access 
and collection device 4 . 

Database access and collection device 4 filters 
the received data to determine which aspects of the 
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received data are to be further analyzed by retrieving 
information regarding the data from various on-line 
databases. For example, database access and collection 
device 4 would determine that the received patent 
numbers should be used to analyze the patents while the 
received financial data might not be further analyzed 
and simply transmitted to data processor 6 for later 
evaluation. 

With respect to the data which is to be analyzed, 
database access and collection device 4 will access the 
various databases having information concerning the 
data to be analyzed and collect the necessary 
information regarding the data. For example, with 
respect to the patent number information, database 
access and collection device 4 would access the ORBIT 
database to determine if the patent is currently 
involved in a pending litigation using such databases 
as the LITALERT Database, or whether the patent is 
under reexam or reissue using such ORBIT databases as 
LEGAL STATUS or PATENT STATUS. Database access and 
collection device 4 could also access the LEXIS/NEXIS 
database to determine whether any newspapers have 
published any current information regarding the patents 
as well as determining whether the patent has been 
involved in previous lawsuits by accessing the legal 
reporter files. 

Finally, database access and collection device 4 
can also access a full-text patent database such as 
DIALOG to either collect the necessary information 
directly from DIALOG or to obtain the patents 
themselves. The types of patent information which 
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would provide important information could be of two 
forms. The first type of information would be patent 
information derived directly from the patents. Such 
information would include number of claims, the length 
of the independent claims, number of references cited, 
number of classes searched, whether the patent is a 
reissue or reexam, number of years until patent expires 
or in which group the patent was examined. In 
addition, the indicators may include whether the 
inventor (s) is a U.S. or foreign citizen, or whether 
the current owner is U.S. or foreign based. Further, 
information regarding U.S. or foreign priority, and 
whether the cited references have publication dates 
near the priority dates could also be considered. 

The second type of information would not be 
derived from the patent itself, but would be 
information derived from other patents. For example, 
this information might be how often the patent being 
acquired has been cited as a reference for other 
patents . 

Similar information could also be collected for 
trademarks which are included in the intellectual 
property portfolio from such databases as DIALOG'S 
FEDERAL TRADEMARK SCAN and STATE TRADEMARK SCAN which 
Inventory federal and state trademarks, respectively. 
Based upon these databases, a user might, for example, 
determine whether a trademark includes disclaimers to 
certain words in the trademark and/or how many classes 
the trademark has been issued for or covers . In 
addition, ORBIT' s LEGAL STATUS database includes recent 
information affecting the trademark, and LEXIS' NEXIS 
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1 database could be used to determine any recent 

2 information relating to the trademark which has been 

3 published in trade magazines or newspapers. 

4 Finally, intellectual property which also includes 

5 copyrighted work could also be considered in a similar 

6 manner. For example, computer software related 

7 intellectual property might include both patents on the 

8 computer system as well as copyrights on the software 

9 itself. 

10 Database access and collection device 4 may be any 

11 standard device which may interface with the various 

12 other databases using, for example, software which is 

13 able to mimic or compatible with the software systems 

14 of the various databases. Accordingly, database access 

15 and collection device 4 may include, for example, the 

16 data collection system in U.S. Patent 3,810,101 or the 

17 information retrieval system in U.S. Patent 4,064,490, 

18 incorporated herein by reference. Additionally, 

19 database access and collection device 4 may also 

20 include, for example, the machine translation system in 

21 U.S. Patent 4,814,988 or the computer method for 

22 automatic extraction of commonly specified information 

23 from business correspondence in U.S. Patent 4,965,763, 

24 incorporated herein by reference. 

25 The collected information, including, for example, 

26 the first and second types of patent information 

27 discussed above, are then transmitted to data processor 

28 6 to process the collected data. The data which does 

29 not require processing in data processor 6 may be 

30 simply passed to indicator weighing device 8. Data 

31 processor 6 would then process the collected data as 
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follows: For each of the above indicators, data 
processor 6 would assign an importance factor, based 
upon predetermined data stored in empirical database 
12, for each of the indicators indicating the 
importance of the collected data with respect to each 
indicator. Data processor 6 may include any standard 
data processor such as the 386 data processor 
manufactured by various companies including Intel and 
may include the various functions of the artificial 
intelligence system in 4,6707848, incorporated herein 
by reference. 

Empirical database 12 may be a single database 
storing all the required empirical data, or empirical 
database 12 may be comprised of several smaller 
databases each storing different required data used by 
the intellectual property audit system of the present 
invention. Empirical database may be any standard 
database and may include, for example, the data storage 
and processing apparatus in U.S. Patent 3,911,403, 
incorporated herein by reference. 

For example, if data access and collection device 
4 searched the DIALOG database and collects information 
that a specific patent has been cited over 100 times, 
i.e., a citation indicator, data processor 6 might 
assign an importance factor of 10 on a scale of 1 to 10 
to the citation indicator. Similarly, if database 
access and collection device 4 determines that the 
patent was searched in only one class for the class 
indicator, data processor 6 might assign a 1 on a scale 
of 1 to 10 to the class indicator. Note that 
currently, both the class and citation indicators have 
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1 the same relative importance. As discussed above, data 

2 processor 6 determines the 10 value for the citation 

3 indicator and the 1 value for the class factor by 

4 comparing the indicators to predetermined indicators 

5 having predetermined values. These predetermined 

6 indicators are based upon collected known indicators 

7 from known intellectual property portfolios. 

8 The determined worth indicators are then 

9 transmitted to an indicator weighing device 8 which 

10 prioritizes each of the indicators against each other 

11 based upon predetermined weighing schemes which have 

12 been determined from known portfolios by also 

13 consulting empirical database 12. For example, the 

14 citation indicator may be more important, for example 

15 twice as important, than the class indicator based upon 

16 predetermined experience. 

17 The weighted indicators are transmitted to 

18 indicator comparing device 10 which compares the 

19 collection of worth indicators to known collections of 

20 worth indicators from known intellectual property 

21 portfolios by consulting database 12 storing the 

22 empirical data. Known distribution or estimation 

23 techniques could be used to determine the closest 

24 matching known intellectual property portfolio to the 

25 intellectual property portfolio which is to be 

26 acquired. Finally, the system would output the known 

27 value for which the portfolio to be acquired matches 

28 the most signifying a rough approximation of the worth 

29 of the portfolio to be acquired. The output may be 

3 0 displayed on any display, such as the display systems 

31 for electronic data processing equipment in U.S. Patent 



Docket: 2423-002B - 20 - PATENT APPLICATION 

3,820,080, incorporated herein by reference. 

In addition to the above features, the present 
invention also includes the feature of manual 
assistance processing in the event the process of the 
present invention fails for known or unknown reasons. 
Accordingly, when a failure occurs, a notice is 
generated to a predetermined location where manual 
assistance may be performed. Each of data input device 
2, database access and collection device 4, data 
processor 6, indicator weighing device 8, indicator 
comparing device 10 are programmed to output manual 
assistance requests to different locations or the same 
location depending on whether the manual work force 
must be spread over more than one location. Thus, the 
present invention is also able to effectively correct 
failures in the processing of the intellectual property 
portfolio in order that the process continue to 
determine an intellectual property portfolio worth 
indicator. Thus, manual assistance may be performed 
for different aspects of the processing, and the 
processing may be restarted in a standard manner for 
the determination of the intellectual property worth 
indicator . 

Advantageously, the present invention also 
utilizes comparison techniques using neural network 
pattern matching processes. The specific types of 
pattern matching techniques implemented by the 
comparison system/device which have already been tested 
and shown to provide excellent results are the standard 
Kohonan and the Back Propagation neural networks, see, 
for example, U.S. Patents 5,146,541 and 5,303,330, 
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incorporated herein by reference. However, other 
pattern matching techniques could also be used, 
depending on the required application. In each type of 
comparison, a neural network is selected that is 
suitable to the requirements of the application. The 
Kohonan and Back Propagation networks are discussed 
below. 

The Kohonan neural network is useful in grouping 
similar patterns. The primary benefit of the Kohonan 
neural net as the basis for finding duplicate 
information is that it does not require training. The 
neural network is built and the categories are created 
as the entries are provided to the Kohonan neural net. 
When a Kohonan neural network was used with the Neural 
Pattern described earlier in connection with Fig. 18, 
effective results are provided for small population 
sizes . 

In a Kohonan neural network each entry is fed into 
the network and a value is returned. By keeping track 
of the output numbers, entries with similar output 
numbers are grouped as similar. One disadvantage of 
the Kohonan neural network is that it may be moderately 
slow and is somewhat ineffective using large 
populations of entries. That is, as the number of 
entries in the system increases, its ability to 
effectively group similar data decreases. 

The Back Propagation neural network is a trainable 
network. Using this method the entries are fed into 
the network in a first pass, which creates the neural 
pattern. Then a second pass is made with each entry to 
determine which values are to be compared. The second 
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1 pass thereby indicates to what extent the current entry 

2 matches any of the entries in the population. There is 

3 a guarantee that the entry will at least match on 

4 itself within the population. 

5 The Back Propagation network is created by 

6 creating an input /output array with as many slots as 

7 there are entries. For example, if there are 2000 

8 entries in the population, then a 2000 slot 

9 input /output array is created. For each entry that is 

10 entered into the Back Propagation network, a second 

11 array is provided indicating which entry in the 

12 sequence it is. The first "entry i n the input/output 

13 array is indicated by setting the first slot to 1 and 

14 the remaining slots to 0. The second entry is 

15 indicated by setting the second slot to 1 and the 

16 remaining slots to 0, and so on. 

17 When the entire Back Propagation network is 

18 trained with the entries, a second pass is made to 

19 evaluate each entry against the population. In the 

20 evaluation phase, each entry is passed through the 

21 network along with an empty input /output array. The 

22 Back Propagation network fills in the array with a 

23 value between 0 and 1 for each slot. A value of 1 

24 indicates an exact match and a value of 0 indicates no 

25 match whatsoever. By scanning the input /output array 

26 for each entry in this manner, a table can be built of 

27 each entries comparative value with all the entries in 

28 the population. Any threshold can be set to consider a 

29 match relevant as potential duplicate or fraudulent 

30 data. For example, a .5 can be considered a relevant 

31 match. In this case if an entry matches any other with 
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1 a value of . 5 or greater, it is considered a potential 

2 duplicate. 

3 The advantages of the Back Propagation network are 

4 that it provides a relative ranking of entries and 

5 their matches with other entries in a population, and 

6 that it can easily be extended to other types of 

7 comparison-related applications. As compared with the 

8 Kohonan, this neural net method provides a value that 

9 indicates the extent one entry matches another. This 

10 can be used to provide different thresholds for 

11 indicating a match. This method can also be used for a 

12 wide variety of comparison-related problems . In cases 

13 where a match on similar values is required without 

14 necessarily grouping items, this method can be used as 

15 opposed to the Kohonan. For example, in many companies 

16 there is a need to find employees that are acting as 

17 vendors to the company, since this is likely a conflict 

18 and may potentially be the basis of fraud. However, 

19 the name, address, social security number or other 

20 information of how the employee is registered as a 

21 vendor will likely be varied from the way the employee 

22 is registered as an employee (e.g., in the human 

23 resource system) . To find such conflicts a Back 

24 Propagation network can be built using the entries of 

25 the human resource system, i.e., the employee database. 

26 Then, each entry of the vendor database can be used to 

27 find whether there is a relative match in the employee 

28 database. Since the entries are translated into one of 

29 the neural -based patterns, the Binomial neural network 

30 will identify similar entries and match on employees 

31 that have slightly altered their identification as 
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1 vendors in the vendor system. 

2 Kohonan and Back Propagation Neural Networks are 

3 standard and may be implemented by, for example, 

4 NEUROWINDOWS : Neural Network Dynamic Link Library, 

5 manufactured by Ward Systems Group, Inc., the manual of 

6 which is incorporated herein by reference. Similar 

7 networks are also disclosed, for example, in Caudill, 

8 M. , The Kohonan Model, Neural Network Primer, AI 

J% 9 Expert, 1990, 25-31; Simpson, P., Artificial Neural 

SO 10 Systems, New York, NY, Pergamon Press, 1990; Wasserman, 

11 P., Neural Computing Theory and Practice, New York, NY, 

LiJ 12 Van Nostrand Reinhold, 1989; Specht D. and Shapiro, P., 

fn 

~ s ; 13 Generalization Accuracy of Probalisitic Neural Networks 

f" 

s 14 Compared With Back-Propagation Networks, Proceedings of 

15 the International Joint Conference on Neural Networks, 

X 16 July 8-12, 1991, 1, 887-892, all of which are 

Ul 17 incorporated herein by reference. 

18 Figs. 3-6 are block diagrams illustrating 

19 additional embodiments of the pattern matching system. 

20 In these embodiments, the hardware configuration is 

21 arranged according to the multiple instruction multiple 

22 data (MIMD) multiprocessor format for additional 

23 computing efficiency. Fig. 3 uses a more distributed 

24 database approach, whereas Fig. 4 uses a central 

25 database. Fig. 5 uses a similar approach across a 

26 public switched telephone network, and Fig. 6 uses a 

27 distributed approach where the different systems are 

28 cross coupled in a standard fashion. The details of 

29 this form of computer architecture are disclosed in 

30 greater detail in, for example, U.S. Patent No. 
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5,163,131, Boxer, A., Where Buses Cannot Go, IEEE 
Spectrum, February 1995, pp. 41-45; and Barroso, L.A. 
et al . , RPM: A Rapid Prototyping Engine for 
Multiprocessor Systems, IEEE Computer February 1995, 
pp. 26-34, all of which are incorporated herein by 

reference. ^ C o w ^ Z \S 

Fig. 7 is an illustration of ma-in central 
parocee-si-ng unite 10 for implementing the computer 
processing in accordance with one embodiment of the 
present invention. In Fig. 7, computer system 218 
includes central processing unit 234 having disk drives 
236 and 238. Disk drive indications 236 and 238 are 
merely symbolic of the number of disk drives which 
might be accommodated in this computer system. 
Typically, these would include a floppy disk drive such 
as 236, a hard disk drive (not shown either internally 
or externally) and a CD ROM indicated by slot 238. The 
number and type of drives varies, typically with 
different computer configurations. The computer 
includes display 240 upon which information is 
displayed. A keyboard 242 and a mouse 244 are 
typically also available as input devices via a 
standard interface. 

Fig. 8 is a block diagram of the internal hardware 
of the computer 218 illustrated in Fig. 7. As 
illustrated in Fig. 8, data bus 248 serves as the main 
information highway interconnecting the other 
components of the computer system. Central processing 
units (CPU) 250 is the central processing unit of the 
system performing calculations and logic operations 
required to execute a program. Read-only memory 252 
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and random access memory 254 constitute the main memory 
of the computer, and may be used to store the 
simulation data. 

Disk controller 256 interfaces one or more disk 
drives to the system bus 248. These disk drives may be 
floppy disk drives such as 262, internal or external 
hard drives such as 260, or CD ROM or DVD (digital 
video disks) drives such as 258. A display interface 
264 interfaces with display 240 and permits information 
from the bus 248 to be displayed on the display 240. 
Communications with the external devices can occur on 
communications port 266. 

Fig. 9 is an illustration of an exemplary memory 
medium which can be used with disk drives such as 262 
in Fig. 8 or 236 in Fig. 7. Typically, memory media 
such as a floppy disk, or a CD ROM, or a digital video 
disk will contain, inter alia , the program information 
for controlling the computer to enable the computer to 
perform the testing and development functions in 
accordance with the computer system described herein. 

Finally, it should be noted that the various steps 
of the present invention are performed in hardware. 
Accordingly, each step of the present invention 
typically generates an electrical signal which 
represents a result of a specific step performed by 
each of the above elements in Figs. 1 and 2. 
Accordingly, the above discussion represents the 
electrical signals which are generated and used in the 
various procedures of the present invention. 

The many features and advantages of the invention 
are apparent from the detailed specification, and thus, 



Docket: 2423-002B - 27 - PATENT APPLICATION 



1 it is intended by the appended claims to cover all such 

2 features and advantages of the invention which fall 

3 within the true spirit and scope of the invention. 

4 Further, since numerous modifications and variations 

5 will readily occur to those skilled in the art, it is 

6 not desired to limit the invention to the exact 

7 construction and operation illustrated and described, 

8 and accordingly, all suitable modifications and 

9 equivalents may be resorted to, falling within the 

10 scope of the invention. 

11 What is claimed is: 



